Investigating the Robustness of Teager Energy Cepstrum Coefficients for Emotion Recognition in Noisy Conditions
نویسندگان
چکیده
This paper investigated the robustness of Teager Energy Cepstrum Coefficient (TECC) in differentiating emotion categories for speech at different White Gaussian noise levels by comparing the performance with MFCC. Experiments involved the normalized squared error measurement, the multi-classes (four classes) emotion classification and the pair-wise emotion classification. This study included four emotion categories (neutral, happy, sad, and happy) from three databases (two English, one German). The result showed that TECC performed equally or outperformed MFCC in both multiemotion and pair-wise emotion classifications at all noise levels for all three databases. Using TECC features only, up to 89% for the four-emotion classification and 99% for the pair-wise emotion classification accuracy rate could be achieved.
منابع مشابه
Auditory Teager energy cepstrum coefficients for robust speech recognition
In this paper, a feature extraction algorithm for robust speech recognition is introduced. The feature extraction algorithm is motivated by the human auditory processing and the nonlinear Teager-Kaiser energy operator that estimates the true energy of the source of a resonance. The proposed features are labeled as Teager Energy Cepstrum Coefficients (TECCs). TECCs are computed by first filterin...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملRobustness of Auditory Teager Energy Cepstrum Coefficients for Classification of Pathological and Normal Voices in Noisy Environments
This paper focuses on a robust feature extraction algorithm for automatic classification of pathological and normal voices in noisy environments. The proposed algorithm is based on human auditory processing and the nonlinear Teager-Kaiser energy operator. The robust features which labeled Teager Energy Cepstrum Coefficients (TECCs) are computed in three steps. Firstly, each speech signal frame ...
متن کاملNoise Suppression Based on Teager Energy Operator for Improving the Robustness of Asr Front-end
In this paper, we proposed a new noise suppression method based on Teager Energy Operator in advancing the noise robustness of speech recognition front-end. The presented method attempts to remove a distortion estimation in Teager energy domain, especially, a Teager energy estimation of noise signal is subtracted from the noisy speech signal. This approach differs significantly from the traditi...
متن کاملRecognition and Classification of Human Emotion from Audio
In this paper, the audio emotion recognition system is proposed that uses a mixture of rule-based and machine learning techniques to improve the recognition efficacy in the audio paths. The audio path is designed using a combination of input prosodic features (pitch, log-energy, zero crossing rates and Teager energy operator) and spectral features (Mel-scale frequency cepstral coefficients). Me...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012